Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 26
Filter
1.
Sci Rep ; 14(1): 8442, 2024 04 10.
Article in English | MEDLINE | ID: mdl-38600110

ABSTRACT

Using clustering analysis for early vital signs, unique patient phenotypes with distinct pathophysiological signatures and clinical outcomes may be revealed and support early clinical decision-making. Phenotyping using early vital signs has proven challenging, as vital signs are typically sampled sporadically. We proposed a novel, deep temporal interpolation and clustering network to simultaneously extract latent representations from irregularly sampled vital signs and derive phenotypes. Four distinct clusters were identified. Phenotype A (18%) had the greatest prevalence of comorbid disease with increased prevalence of prolonged respiratory insufficiency, acute kidney injury, sepsis, and long-term (3-year) mortality. Phenotypes B (33%) and C (31%) had a diffuse pattern of mild organ dysfunction. Phenotype B's favorable short-term clinical outcomes were tempered by the second highest rate of long-term mortality. Phenotype C had favorable clinical outcomes. Phenotype D (17%) exhibited early and persistent hypotension, high incidence of early surgery, and substantial biomarker incidence of inflammation. Despite early and severe illness, phenotype D had the second lowest long-term mortality. After comparing the sequential organ failure assessment scores, the clustering results did not simply provide a recapitulation of previous acuity assessments. This tool may impact triage decisions and have significant implications for clinical decision-support under time constraints and uncertainty.


Subject(s)
Organ Dysfunction Scores , Sepsis , Humans , Acute Disease , Phenotype , Biomarkers , Cluster Analysis
2.
Am J Surg ; 232: 45-53, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38383166

ABSTRACT

BACKGROUND: There is no consensus regarding safe intraoperative blood pressure thresholds that protect against postoperative acute kidney injury (AKI). This review aims to examine the existing literature to delineate safe intraoperative hypotension (IOH) parameters to prevent postoperative AKI. METHODS: PubMed, Cochrane Central, and Web of Science were systematically searched for articles published between 2015 and 2022 relating the effects of IOH on postoperative AKI. RESULTS: Our search yielded 19 articles. IOH risk thresholds ranged from <50 to <75 â€‹mmHg for mean arterial pressure (MAP) and from <70 to <100 â€‹mmHg for systolic blood pressure (SBP). MAP below 65 â€‹mmHg for over 5 â€‹min was the most cited threshold (N â€‹= â€‹13) consistently associated with increased postoperative AKI. Greater magnitude and duration of MAP and SBP below the thresholds were generally associated with a dose-dependent increase in postoperative AKI incidence. CONCLUSIONS: While a consistent definition for IOH remains elusive, the evidence suggests that MAP below 65 â€‹mmHg for over 5 â€‹min is strongly associated with postoperative AKI, with the risk increasing with the magnitude and duration of IOH.


Subject(s)
Acute Kidney Injury , Hypotension , Intraoperative Complications , Postoperative Complications , Humans , Acute Kidney Injury/etiology , Acute Kidney Injury/epidemiology , Acute Kidney Injury/prevention & control , Hypotension/etiology , Hypotension/epidemiology , Hypotension/prevention & control , Postoperative Complications/epidemiology , Postoperative Complications/prevention & control , Postoperative Complications/etiology , Intraoperative Complications/prevention & control , Intraoperative Complications/epidemiology , Intraoperative Complications/etiology
3.
Sci Rep ; 13(1): 17781, 2023 10 18.
Article in English | MEDLINE | ID: mdl-37853103

ABSTRACT

Persistence of acute kidney injury (AKI) or insufficient recovery of renal function was associated with reduced long-term survival and life quality. We quantified AKI trajectories and describe transitions through progression and recovery among hospitalized patients. 245,663 encounters from 128,271 patients admitted to UF Health between 2012 and 2019 were retrospectively categorized according to the worst AKI stage experienced within 24-h periods. Multistate models were fit for describing characteristics influencing transitions towards progressed or regressed AKI, discharge, and death. Effects of age, sex, race, admission comorbidities, and prolonged intensive care unit stay (ICU) on transition rates were examined via Cox proportional hazards models. About 20% of encounters had AKI; where 66% of those with AKI had Stage 1 as their worst AKI severity during hospitalization, 18% had Stage 2, and 16% had Stage 3 AKI (12% with kidney replacement therapy (KRT) and 4% without KRT). At 3 days following Stage 1 AKI, 71.1% (70.5-71.6%) were either resolved to No AKI or discharged, while recovery proportion was 38% (37.4-38.6%) and discharge proportion was 7.1% (6.9-7.3%) following AKI Stage 2. At 14 days following Stage 1 AKI, patients with additional frail conditions stay had lower transition proportion towards No AKI or discharge states. Multistate modeling framework is a facilitating mechanism for understanding AKI clinical course and examining characteristics influencing disease process and transition rates.


Subject(s)
Acute Kidney Injury , Intensive Care Units , Humans , Retrospective Studies , Acute Kidney Injury/epidemiology , Acute Kidney Injury/therapy , Renal Replacement Therapy , Disease Progression , Risk Factors
4.
JMIR Med Inform ; 11: e48297, 2023 Aug 24.
Article in English | MEDLINE | ID: mdl-37646309

ABSTRACT

Background: Machine learning-enabled clinical information systems (ML-CISs) have the potential to drive health care delivery and research. The Fast Healthcare Interoperability Resources (FHIR) data standard has been increasingly applied in developing these systems. However, methods for applying FHIR to ML-CISs are variable. Objective: This study evaluates and compares the functionalities, strengths, and weaknesses of existing systems and proposes guidelines for optimizing future work with ML-CISs. Methods: Embase, PubMed, and Web of Science were searched for articles describing machine learning systems that were used for clinical data analytics or decision support in compliance with FHIR standards. Information regarding each system's functionality, data sources, formats, security, performance, resource requirements, scalability, strengths, and limitations was compared across systems. Results: A total of 39 articles describing FHIR-based ML-CISs were divided into the following three categories according to their primary focus: clinical decision support systems (n=18), data management and analytic platforms (n=10), or auxiliary modules and application programming interfaces (n=11). Model strengths included novel use of cloud systems, Bayesian networks, visualization strategies, and techniques for translating unstructured or free-text data to FHIR frameworks. Many intelligent systems lacked electronic health record interoperability and externally validated evidence of clinical efficacy. Conclusions: Shortcomings in current ML-CISs can be addressed by incorporating modular and interoperable data management, analytic platforms, secure interinstitutional data exchange, and application programming interfaces with adequate scalability to support both real-time and prospective clinical applications that use electronic health record platforms with diverse implementations.

5.
Surgery ; 174(3): 709-714, 2023 09.
Article in English | MEDLINE | ID: mdl-37316372

ABSTRACT

BACKGROUND: Acute kidney injury is a common postoperative complication affecting between 10% and 30% of surgical patients. Acute kidney injury is associated with increased resource usage and chronic kidney disease development, with more severe acute kidney injury suggesting more aggressive deterioration in clinical outcomes and mortality. METHODS: We considered 42,906 surgical patients admitted to University of Florida Health (n = 51,806) between 2014 and 2021. Acute kidney injury stages were determined using the Kidney Disease Improving Global Outcomes serum creatinine criteria. We developed a recurrent neural network-based model to continuously predict acute kidney injury risk and state in the following 24 hours and compared it with logistic regression, random forest, and multi-layer perceptron models. We used medications, laboratory and vital measurements, and derived features from past one-year records as inputs. We analyzed the proposed model with integrated gradients for enhanced explainability. RESULTS: Postoperative acute kidney injury at any stage developed in 20% (10,664) of the cohort. The recurrent neural network model was more accurate in predicting nearly all categories of next-day acute kidney injury stages (including the no acute kidney injury group). The area under the receiver operating curve and 95% confidence intervals for recurrent neural network and logistic regression models were for no acute kidney injury (0.98 [0.98-0.98] vs 0.93 [0.93-0.93]), stage 1 (0.95 [0.95-0.95] vs. 0.81 [0.80-0.82]), stage 2/3 (0.99 [0.99-0.99] vs 0.96 [0.96-0.97]), and stage 3 with renal replacement therapy (1.0 [1.0-1.0] vs 1.0 [1.0-1.0]. CONCLUSION: The proposed model demonstrates that temporal processing of patient information can lead to more granular and dynamic modeling of acute kidney injury status and result in more continuous and accurate acute kidney injury prediction. We showcase the integrated gradients framework's utility as a mechanism for enhancing model explainability, potentially facilitating clinical trust for future implementation.


Subject(s)
Acute Kidney Injury , Deep Learning , Humans , Acute Kidney Injury/diagnosis , Acute Kidney Injury/epidemiology , Acute Kidney Injury/etiology , Logistic Models , Forecasting , Kidney
6.
ArXiv ; 2023 Mar 08.
Article in English | MEDLINE | ID: mdl-36945689

ABSTRACT

OBJECTIVES: We aim to quantify longitudinal acute kidney injury (AKI) trajectories and to describe transitions through progressing and recovery states and outcomes among hospitalized patients using multistate models. METHODS: In this large, longitudinal cohort study, 138,449 adult patients admitted to a quaternary care hospital between 2012 and 2019 were staged based on Kidney Disease: Improving Global Outcomes serum creatinine criteria for the first 14 days of their hospital stay. We fit multistate models to estimate probability of being in a certain clinical state at a given time after entering each one of the AKI stages. We investigated the effects of selected variables on transition rates via Cox proportional hazards regression models. RESULTS: Twenty percent of hospitalized encounters (49,325/246,964) had AKI; among patients with AKI, 66% had Stage 1 AKI, 18% had Stage 2 AKI, and 17% had AKI Stage 3 with or without RRT. At seven days following Stage 1 AKI, 69% (95% confidence interval [CI]: 68.8%-70.5%) were either resolved to No AKI or discharged, while smaller proportions of recovery (26.8%, 95% CI: 26.1%-27.5%) and discharge (17.4%, 95% CI: 16.8%-18.0%) were observed following AKI Stage 2. At 14 days following Stage 1 AKI, patients with more frail conditions (Charlson comorbidity index greater than or equal to 3 and had prolonged ICU stay) had lower proportion of transitioning to No AKI or discharge states. DISCUSSION: Multistate analyses showed that the majority of Stage 2 and higher severity AKI patients could not resolve within seven days; therefore, strategies preventing the persistence or progression of AKI would contribute to the patients' life quality. CONCLUSIONS: We demonstrate multistate modeling framework's utility as a mechanism for a better understanding of the clinical course of AKI with the potential to facilitate treatment and resource planning.

7.
J Trauma Acute Care Surg ; 94(6): 814-822, 2023 06 01.
Article in English | MEDLINE | ID: mdl-36727772

ABSTRACT

BACKGROUND: In traumatic hemorrhage, hybrid operating rooms offer near simultaneous performance of endovascular and open techniques, with correlations to earlier hemorrhage control, fewer transfusions, and possible decreased mortality. However, hybrid operating rooms are resource intensive. This study quantifies and describes a single-center experience with the complications, cost-utility, and value of a dedicated trauma hybrid operating room. METHODS: This retrospective cohort study evaluated 292 consecutive adult trauma patients who underwent immediate (<4 hours) operative intervention at a Level I trauma center. A total of 106 patients treated before the construction of a hybrid operating room served as historical controls to the 186 patients treated thereafter. Demographics, hemorrhage-control procedures, and financial data as well as postoperative complications and outcomes were collected via electronic medical records. Value and incremental cost-utility ratio were calculated. RESULTS: Demographics and severity of illness were similar between cohorts. Resuscitative endovascular occlusion of the aorta was more frequently used in the hybrid operating room. Hemorrhage control occurred faster (60 vs. 49 minutes, p = 0.005) and, in the 4- to 24-hour postadmission period, required less red blood cell (mean, 1.0 vs. 0 U, p = 0.001) and plasma (mean, 1.0 vs. 0 U, p < 0.001) transfusions. Complications were similar except for a significant decrease in pneumonia (7% vs. 4%, p = 0.008). Severe complications (Clavien-Dindo classification, ≥3) were similar. Across the patient admission, costs were not significantly different ($50,023 vs. $54,740, p = 0.637). There was no change in overall value (1.00 vs. 1.07, p = 0.778). CONCLUSION: The conversion of our standard trauma operating room to an endovascular hybrid operating room provided measurable improvements in hemorrhage control, red blood cell and plasma transfusions, and postoperative pneumonia without significant increase in cost. Value was unchanged. LEVEL OF EVIDENCE: Economic/Value-Based Evaluations; Level III.


Subject(s)
Endovascular Procedures , Operating Rooms , Adult , Humans , Retrospective Studies , Hemorrhage/etiology , Hemorrhage/therapy , Resuscitation/methods , Blood Transfusion , Endovascular Procedures/methods , Trauma Centers
8.
J Am Coll Surg ; 236(2): 279-291, 2023 02 01.
Article in English | MEDLINE | ID: mdl-36648256

ABSTRACT

BACKGROUND: In single-institution studies, overtriaging low-risk postoperative patients to ICUs has been associated with a low value of care; undertriaging high-risk postoperative patients to general wards has been associated with increased mortality and morbidity. This study tested the reproducibility of an automated postoperative triage classification system to generating an actionable, explainable decision support system. STUDY DESIGN: This longitudinal cohort study included adults undergoing inpatient surgery at two university hospitals. Triage classifications were generated by an explainable deep learning model using preoperative and intraoperative electronic health record features. Nearest neighbor algorithms identified risk-matched controls. Primary outcomes were mortality, morbidity, and value of care (inverted risk-adjusted mortality/total direct costs). RESULTS: Among 4,669 ICU admissions, 237 (5.1%) were overtriaged. Compared with 1,021 control ward admissions, overtriaged admissions had similar outcomes but higher costs ($15.9K [interquartile range $9.8K to $22.3K] vs $10.7K [$7.0K to $17.6K], p < 0.001) and lower value of care (0.2 [0.1 to 0.3] vs 1.5 [0.9 to 2.2], p < 0.001). Among 8,594 ward admissions, 1,029 (12.0%) were undertriaged. Compared with 2,498 control ICU admissions, undertriaged admissions had longer hospital length-of-stays (6.4 [3.4 to 12.4] vs 5.4 [2.6 to 10.4] days, p < 0.001); greater incidence of hospital mortality (1.7% vs 0.7%, p = 0.03), cardiac arrest (1.4% vs 0.5%, p = 0.04), and persistent acute kidney injury without renal recovery (5.2% vs 2.8%, p = 0.002); similar costs ($21.8K [$13.3K to $34.9K] vs $21.9K [$13.1K to $36.3K]); and lower value of care (0.8 [0.5 to 1.3] vs 1.2 [0.7 to 2.0], p < 0.001). CONCLUSIONS: Overtriage was associated with low value of care; undertriage was associated with both low value of care and increased mortality and morbidity. The proposed framework for generating automated postoperative triage classifications is reproducible.


Subject(s)
Deep Learning , Adult , Humans , Longitudinal Studies , Reproducibility of Results , Triage , Cohort Studies , Retrospective Studies
9.
Physiol Meas ; 44(2)2023 02 09.
Article in English | MEDLINE | ID: mdl-36657179

ABSTRACT

Objective. In 2019, the University of Florida College of Medicine launched theMySurgeryRiskalgorithm to predict eight major post-operative complications using automatically extracted data from the electronic health record.Approach. This project was developed in parallel with our Intelligent Critical Care Center and represents a culmination of efforts to build an efficient and accurate model for data processing and predictive analytics.Main Results and Significance. This paper discusses how our model was constructed and improved upon. We highlight the consolidation of the database, processing of fixed and time-series physiologic measurements, development and training of predictive models, and expansion of those models into different aspects of patient assessment and treatment. We end by discussing future directions of the model.


Subject(s)
Electronic Health Records , Machine Learning , Humans , Forecasting
10.
Crit Care Explor ; 5(1): e0848, 2023 Jan.
Article in English | MEDLINE | ID: mdl-36699252

ABSTRACT

To evaluate the methodologic rigor and predictive performance of models predicting ICU readmission; to understand the characteristics of ideal prediction models; and to elucidate relationships between appropriate triage decisions and patient outcomes. DATA SOURCES: PubMed, Web of Science, Cochrane, and Embase. STUDY SELECTION: Primary literature that reported the development or validation of ICU readmission prediction models within from 2010 to 2021. DATA EXTRACTION: Relevant study information was extracted independently by two authors using the Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies checklist. Bias was evaluated using the Prediction model Risk Of Bias ASsessment Tool. Data sources, modeling methodology, definition of outcomes, performance, and risk of bias were critically evaluated to elucidate relevant relationships. DATA SYNTHESIS: Thirty-three articles describing models were included. Six studies had a high overall risk of bias due to improper inclusion criteria or omission of critical analysis details. Four other studies had an unclear overall risk of bias due to lack of detail describing the analysis. Overall, the most common (50% of studies) source of bias was the filtering of candidate predictors via univariate analysis. The poorest performing models used existing clinical risk or acuity scores such as Acute Physiologic Assessment and Chronic Health Evaluation II, Sequential Organ Failure Assessment, or Stability and Workload Index for Transfer as the sole predictor. The higher-performing ICU readmission prediction models used homogenous patient populations, specifically defined outcomes, and routinely collected predictors that were analyzed over time. CONCLUSIONS: Models predicting ICU readmission can achieve performance advantages by using longitudinal time series modeling, homogenous patient populations, and predictor variables tailored to those populations.

11.
Ann Surg ; 277(2): 179-185, 2023 02 01.
Article in English | MEDLINE | ID: mdl-35797553

ABSTRACT

OBJECTIVE: We test the hypothesis that for low-acuity surgical patients, postoperative intensive care unit (ICU) admission is associated with lower value of care compared with ward admission. BACKGROUND: Overtriaging low-acuity patients to ICU consumes valuable resources and may not confer better patient outcomes. Associations among postoperative overtriage, patient outcomes, costs, and value of care have not been previously reported. METHODS: In this longitudinal cohort study, postoperative ICU admissions were classified as overtriaged or appropriately triaged according to machine learning-based patient acuity assessments and requirements for immediate postoperative mechanical ventilation or vasopressor support. The nearest neighbors algorithm identified risk-matched control ward admissions. The primary outcome was value of care, calculated as inverse observed-to-expected mortality ratios divided by total costs. RESULTS: Acuity assessments had an area under the receiver operating characteristic curve of 0.92 in generating predictions for triage classifications. Of 8592 postoperative ICU admissions, 423 (4.9%) were overtriaged. These were matched with 2155 control ward admissions with similar comorbidities, incidence of emergent surgery, immediate postoperative vital signs, and do not resuscitate order placement and rescindment patterns. Compared with controls, overtraiged admissions did not have a lower incidence of any measured complications. Total costs for admission were $16.4K for overtriage and $15.9K for controls ( P =0.03). Value of care was lower for overtriaged admissions [2.9 (2.0-4.0)] compared with controls [24.2 (14.1-34.5), P <0.001]. CONCLUSIONS: Low-acuity postoperative patients who were overtriaged to ICUs had increased total costs, no improvements in outcomes, and received low-value care.


Subject(s)
Hospitalization , Intensive Care Units , Humans , Longitudinal Studies , Retrospective Studies , Cohort Studies
12.
Article in English | MEDLINE | ID: mdl-36532301

ABSTRACT

Established guidelines describe minimum requirements for reporting algorithms in healthcare; it is equally important to objectify the characteristics of ideal algorithms that confer maximum potential benefits to patients, clinicians, and investigators. We propose a framework for ideal algorithms, including 6 desiderata: explainable (convey the relative importance of features in determining outputs), dynamic (capture temporal changes in physiologic signals and clinical events), precise (use high-resolution, multimodal data and aptly complex architecture), autonomous (learn with minimal supervision and execute without human input), fair (evaluate and mitigate implicit bias and social inequity), and reproducible (validated externally and prospectively and shared with academic communities). We present an ideal algorithms checklist and apply it to highly cited algorithms. Strategies and tools such as the predictive, descriptive, relevant (PDR) framework, the Standard Protocol Items: Recommendations for Interventional Trials-Artificial Intelligence (SPIRIT-AI) extension, sparse regression methods, and minimizing concept drift can help healthcare algorithms achieve these objectives, toward ideal algorithms in healthcare.

13.
Digit Health ; 8: 20552076221134455, 2022.
Article in English | MEDLINE | ID: mdl-36325438

ABSTRACT

Generalizability, external validity, and reproducibility are high priorities for artificial intelligence applications in healthcare. Traditional approaches to addressing these elements involve sharing patient data between institutions or practice settings, which can compromise data privacy (individuals' right to prevent the sharing and disclosure of information about themselves) and data security (simultaneously preserving confidentiality, accuracy, fidelity, and availability of data). This article describes insights from real-world implementation of federated learning techniques that offer opportunities to maintain both data privacy and availability via collaborative machine learning that shares knowledge, not data. Local models are trained separately on local data. As they train, they send local model updates (e.g. coefficients or gradients) for consolidation into a global model. In some use cases, global models outperform local models on new, previously unseen local datasets, suggesting that collaborative learning from a greater number of examples, including a greater number of rare cases, may improve predictive performance. Even when sharing model updates rather than data, privacy leakage can occur when adversaries perform property or membership inference attacks which can be used to ascertain information about the training set. Emerging techniques mitigate risk from adversarial attacks, allowing investigators to maintain both data privacy and availability in collaborative healthcare research. When data heterogeneity between participating centers is high, personalized algorithms may offer greater generalizability by improving performance on data from centers with proportionately smaller training sample sizes. Properly applied, federated learning has the potential to optimize the reproducibility and performance of collaborative learning while preserving data security and privacy.

14.
J Am Med Inform Assoc ; 29(12): 2105-2109, 2022 11 14.
Article in English | MEDLINE | ID: mdl-36305781

ABSTRACT

Healthcare systems are hampered by incomplete and fragmented patient health records. Record linkage is widely accepted as a solution to improve the quality and completeness of patient records. However, there does not exist a systematic approach for manually reviewing patient records to create gold standard record linkage data sets. We propose a robust framework for creating and evaluating manually reviewed gold standard data sets for measuring the performance of patient matching algorithms. Our 8-point approach covers data preprocessing, blocking, record adjudication, linkage evaluation, and reviewer characteristics. This framework can help record linkage method developers provide necessary transparency when creating and validating gold standard reference matching data sets. In turn, this transparency will support both the internal and external validity of recording linkage studies and improve the robustness of new record linkage strategies.


Subject(s)
Health Records, Personal , Medical Record Linkage , Humans , Medical Record Linkage/methods , Algorithms , Information Storage and Retrieval , Data Collection
15.
JAMA Netw Open ; 5(5): e2211973, 2022 05 02.
Article in English | MEDLINE | ID: mdl-35576007

ABSTRACT

Importance: Predicting postoperative complications has the potential to inform shared decisions regarding the appropriateness of surgical procedures, targeted risk-reduction strategies, and postoperative resource use. Realizing these advantages requires that accurate real-time predictions be integrated with clinical and digital workflows; artificial intelligence predictive analytic platforms using automated electronic health record (EHR) data inputs offer an intriguing possibility for achieving this, but there is a lack of high-level evidence from prospective studies supporting their use. Objective: To examine whether the MySurgeryRisk artificial intelligence system has stable predictive performance between development and prospective validation phases and whether it is feasible to provide automated outputs directly to surgeons' mobile devices. Design, Setting, and Participants: In this prognostic study, the platform used automated EHR data inputs and machine learning algorithms to predict postoperative complications and provide predictions to surgeons, previously through a web portal and currently through a mobile device application. All patients 18 years or older who were admitted for any type of inpatient surgical procedure (74 417 total procedures involving 58 236 patients) between June 1, 2014, and September 20, 2020, were included. Models were developed using retrospective data from 52 117 inpatient surgical procedures performed between June 1, 2014, and November 27, 2018. Validation was performed using data from 22 300 inpatient surgical procedures collected prospectively from November 28, 2018, to September 20, 2020. Main Outcomes and Measures: Algorithms for generalized additive models and random forest models were developed and validated using real-time EHR data. Model predictive performance was evaluated primarily using area under the receiver operating characteristic curve (AUROC) values. Results: Among 58 236 total adult patients who received 74 417 major inpatient surgical procedures, the mean (SD) age was 57 (17) years; 29 226 patients (50.2%) were male. Results reported in this article focus primarily on the validation cohort. The validation cohort included 22 300 inpatient surgical procedures involving 19 132 patients (mean [SD] age, 58 [17] years; 9672 [50.6%] male). A total of 2765 patients (14.5%) were Black or African American, 14 777 (77.2%) were White, 1235 (6.5%) were of other races (including American Indian or Alaska Native, Asian, Native Hawaiian or Pacific Islander, and multiracial), and 355 (1.9%) were of unknown race because of missing data; 979 patients (5.1%) were Hispanic, 17 663 (92.3%) were non-Hispanic, and 490 (2.6%) were of unknown ethnicity because of missing data. A greater number of input features was associated with stable or improved model performance. For example, the random forest model trained with 135 input features had the highest AUROC values for predicting acute kidney injury (0.82; 95% CI, 0.82-0.83); cardiovascular complications (0.81; 95% CI, 0.81-0.82); neurological complications, including delirium (0.87; 95% CI, 0.87-0.88); prolonged intensive care unit stay (0.89; 95% CI, 0.88-0.89); prolonged mechanical ventilation (0.91; 95% CI, 0.90-0.91); sepsis (0.86; 95% CI, 0.85-0.87); venous thromboembolism (0.82; 95% CI, 0.81-0.83); wound complications (0.78; 95% CI, 0.78-0.79); 30-day mortality (0.84; 95% CI, 0.82-0.86); and 90-day mortality (0.84; 95% CI, 0.82-0.85), with accuracy similar to surgeons' predictions. Compared with the original web portal, the mobile device application allowed efficient fingerprint login access and loaded data approximately 10 times faster. The application output displayed patient information, risk of postoperative complications, top 3 risk factors for each complication, and patterns of complications for individual surgeons compared with their colleagues. Conclusions and Relevance: In this study, automated real-time predictions of postoperative complications with mobile device outputs had good performance in clinical settings with prospective validation, matching surgeons' predictive accuracy.


Subject(s)
Artificial Intelligence , Electronic Health Records , Adult , Algorithms , Female , Humans , Machine Learning , Male , Middle Aged , Postoperative Complications/epidemiology , Postoperative Complications/etiology , Prospective Studies , Retrospective Studies
16.
Article in English | MEDLINE | ID: mdl-36590140

ABSTRACT

Mistrust is a major barrier to implementing deep learning in healthcare settings. Entrustment could be earned by conveying model certainty, or the probability that a given model output is accurate, but the use of uncertainty estimation for deep learning entrustment is largely unexplored, and there is no consensus regarding optimal methods for quantifying uncertainty. Our purpose is to critically evaluate methods for quantifying uncertainty in deep learning for healthcare applications and propose a conceptual framework for specifying certainty of deep learning predictions. We searched Embase, MEDLINE, and PubMed databases for articles relevant to study objectives, complying with PRISMA guidelines, rated study quality using validated tools, and extracted data according to modified CHARMS criteria. Among 30 included studies, 24 described medical imaging applications. All imaging model architectures used convolutional neural networks or a variation thereof. The predominant method for quantifying uncertainty was Monte Carlo dropout, producing predictions from multiple networks for which different neurons have dropped out and measuring variance across the distribution of resulting predictions. Conformal prediction offered similar strong performance in estimating uncertainty, along with ease of interpretation and application not only to deep learning but also to other machine learning approaches. Among the six articles describing non-imaging applications, model architectures and uncertainty estimation methods were heterogeneous, but predictive performance was generally strong, and uncertainty estimation was effective in comparing modeling methods. Overall, the use of model learning curves to quantify epistemic uncertainty (attributable to model parameters) was sparse. Heterogeneity in reporting methods precluded the performance of a meta-analysis. Uncertainty estimation methods have the potential to identify rare but important misclassifications made by deep learning models and compare modeling methods, which could build patient and clinician trust in deep learning applications in healthcare. Efficient maturation of this field will require standardized guidelines for reporting performance and uncertainty metrics.

17.
PLOS Digit Health ; 1(10)2022.
Article in English | MEDLINE | ID: mdl-36590701

ABSTRACT

During the early stages of hospital admission, clinicians use limited information to make decisions as patient acuity evolves. We hypothesized that clustering analysis of vital signs measured within six hours of hospital admission would reveal distinct patient phenotypes with unique pathophysiological signatures and clinical outcomes. We created a longitudinal electronic health record dataset for 75,762 adult patient admissions to a tertiary care center in 2014-2016 lasting six hours or longer. Physiotypes were derived via unsupervised machine learning in a training cohort of 41,502 patients applying consensus k-means clustering to six vital signs measured within six hours of admission. Reproducibility and correlation with clinical biomarkers and outcomes were assessed in validation cohort of 17,415 patients and testing cohort of 16,845 patients. Training, validation, and testing cohorts had similar age (54-55 years) and sex (55% female), distributions. There were four distinct clusters. Physiotype A had physiologic signals consistent with early vasoplegia, hypothermia, and low-grade inflammation and favorable short-and long-term clinical outcomes despite early, severe illness. Physiotype B exhibited early tachycardia, tachypnea, and hypoxemia followed by the highest incidence of prolonged respiratory insufficiency, sepsis, acute kidney injury, and short- and long-term mortality. Physiotype C had minimal early physiological derangement and favorable clinical outcomes. Physiotype D had the greatest prevalence of chronic cardiovascular and kidney disease, presented with severely elevated blood pressure, and had good short-term outcomes but suffered increased 3-year mortality. Comparing sequential organ failure assessment (SOFA) scores across physiotypes demonstrated that clustering did not simply recapitulate previously established acuity assessments. In a heterogeneous cohort of hospitalized patients, unsupervised machine learning techniques applied to routine, early vital sign data identified physiotypes with unique disease categories and distinct clinical outcomes. This approach has the potential to augment understanding of pathophysiology by distilling thousands of disease states into a few physiological signatures.

18.
Ann Surg ; 275(2): 332-339, 2022 02 01.
Article in English | MEDLINE | ID: mdl-34261886

ABSTRACT

OBJECTIVE: Develop unifying definitions and paradigms for data-driven methods to augment postoperative resource intensity decisions. SUMMARY BACKGROUND DATA: Postoperative level-of-care assignments and frequency of vital sign and laboratory measurements (ie, resource intensity) should align with patient acuity. Effective, data-driven decision-support platforms could improve value of care for millions of patients annually, but their development is hindered by the lack of salient definitions and paradigms. METHODS: Embase, PubMed, and Web of Science were searched for articles describing patient acuity and resource intensity after inpatient surgery. Study quality was assessed using validated tools. Thirty-five studies were included and assimilated according to PRISMA guidelines. RESULTS: Perioperative patient acuity is accurately represented by combinations of demographic, physiologic, and hospital-system variables as input features in models that capture complex, non-linear relationships. Intraoperative physiologic data enriche these representations. Triaging high-acuity patients to low-intensity care is associated with increased risk for mortality; triaging low-acuity patients to intensive care units (ICUs) has low value and imparts harm when other, valid requests for ICU admission are denied due to resource limitations, increasing their risk for unrecognized decompensation and failure-to-rescue. Providing high-intensity care for low-acuity patients may also confer harm through unnecessary testing and subsequent treatment of incidental findings, but there is insufficient evidence to evaluate this hypothesis. Compared with data-driven models, clinicians exhibit volatile performance in predicting complications and making postoperative resource intensity decisions. CONCLUSION: To optimize value, postoperative resource intensity decisions should align with precise, data-driven patient acuity assessments augmented by models that accurately represent complex, non-linear relationships among risk factors.


Subject(s)
Health Resources , Patient Acuity , Surgical Procedures, Operative , Humans , Postoperative Period
19.
JAMA Netw Open ; 4(11): e2131669, 2021 11 01.
Article in English | MEDLINE | ID: mdl-34757412

ABSTRACT

Importance: Undertriaging patients who are at increased risk for postoperative complications after surgical procedures to low-acuity hospital wards (ie, floors) rather than highly vigilant intensive care units (ICUs) may be associated with risk of unrecognized decompensation and worse patient outcomes, but evidence for these associations is lacking. Objective: To test the hypothesis that postoperative undertriage is associated with increased mortality and morbidity compared with risk-matched ICU admission. Design, Setting, and Participants: This longitudinal cross-sectional study was conducted using data from the University of Florida Integrated Data Repository on admissions to a university hospital. Included patients were individuals aged 18 years or older who were admitted after a surgical procedure from June 1, 2014, to August 20, 2020. Data were analyzed from April through August 2021. Exposures: Ward admissions were considered undertriaged if their estimated risk for hospital mortality or prolonged ICU stay (ie, ≥48 hours) was in the top quartile among all inpatient surgical procedures according to a validated machine-learning model using preoperative and intraoperative electronic health record features available at surgical procedure end time. A nearest neighbors algorithm was used to identify a risk-matched control group of ICU admissions. Main Outcomes and Measures: The primary outcomes of hospital mortality and morbidity were compared among appropriately triaged ward admissions, undertriaged wards admissions, and a risk-matched control group of ICU admissions. Results: Among 12 348 postoperative ward admissions, 11 042 admissions (89.4%) were appropriately triaged (5927 [53.7%] women; median [IQR] age, 59 [44-70] years) and 1306 admissions (10.6%) were undertriaged and matched with a control group of 2452 ICU admissions. The undertriaged group, compared with the control group, had increased median [IQR] age (64 [54-74] years vs 62 [50-73] years; P = .001) and increased proportions of women (649 [49.7%] women vs 1080 [44.0%] women; P < .001) and admitted patients with do not resuscitate orders before first surgical procedure (53 admissions [4.1%] vs 27 admissions [1.1%]); P < .001); 207 admissions that were undertriaged (15.8%) had subsequent ICU admission. In the validation cohort, hospital mortality and prolonged ICU stay estimations had areas under the receiver operating characteristic curve of 0.92 (95% CI, 0.91-0.93) and 0.92 (95% CI, 0.92-0.92), respectively. The undertriaged group, compared with the control group, had similar incidence of prolonged mechanical ventilation (32 admissions [2.5%] vs 53 admissions [2.2%]; P = .60), decreased median (IQR) total costs for admission ($26 900 [$18 400-$42 300] vs $32 700 [$22 700-$48 500]; P < .001), increased median (IQR) hospital length of stay (8.1 [5.1-13.6] days vs 6.0 [3.3-9.3] days, P < .001), and increased incidence of hospital mortality (19 admissions [1.5%] vs 17 admissions [0.7%]; P = .04), discharge to hospice (23 admissions [1.8%] vs 14 admissions [0.6%]; P < .001), unplanned intubation (45 admissions [3.4%] vs 49 admissions [2.0%]; P = .01), and acute kidney injury (341 admissions [26.1%] vs 477 admissions [19.5%]; P < .001). Conclusions and Relevance: This study found that admitted patients at increased risk for postoperative complications who were undertriaged to hospital wards had increased mortality and morbidity compared with a risk-matched control group of admissions to ICUs. Postoperative undertriage was identifiable using automated preoperative and intraoperative data as features in real-time machine-learning models.


Subject(s)
Hospital Mortality , Intensive Care Units/statistics & numerical data , Postoperative Care/methods , Postoperative Care/statistics & numerical data , Postoperative Complications/epidemiology , Triage/statistics & numerical data , Adult , Aged , Aged, 80 and over , Cross-Sectional Studies , Female , Florida/epidemiology , Hospitals, University , Humans , Length of Stay/economics , Length of Stay/statistics & numerical data , Longitudinal Studies , Male , Middle Aged , Patients' Rooms , Postoperative Complications/economics , Risk Factors , Triage/methods
20.
Curr Opin Crit Care ; 27(6): 560-572, 2021 12 01.
Article in English | MEDLINE | ID: mdl-34757993

ABSTRACT

PURPOSE OF REVIEW: Acute kidney injury (AKI) affects nearly 60% of all patients admitted to ICUs. Large volumes of clinical, monitoring and laboratory data produced in ICUs allow the application of artificial intelligence analytics. The purpose of this article is to assimilate and critically evaluate recently published literature regarding artificial intelligence applications for predicting, diagnosing and subphenotyping AKI among critically ill patients. RECENT FINDINGS: Among recent studies regarding artificial intelligence implementations for predicting, diagnosing and subphenotyping AKI among critically ill patients, there are many promising models, but few had external validation, clinical interpretability and high predictive performance. Deep learning techniques leveraging multimodal clinical data show great potential to provide continuous, accurate, early predictions of AKI risk, which could be implemented clinically to optimize preventive and early therapeutic management strategies. SUMMARY: Use of consensus criteria, standard definitions and common data models could facilitate access to machine learning-ready data sets for external validation. The lack of interpretability, explainability, fairness and transparency of artificial intelligence models hinder their entrustment and clinical implementation; compliance with standardized reporting guidelines can mitigate these challenges.


Subject(s)
Acute Kidney Injury , Deep Learning , Artificial Intelligence , Critical Illness , Humans , Intensive Care Units
SELECTION OF CITATIONS
SEARCH DETAIL
...